Averages, distributions and scalability of MPI communication times for ethernet and myrinet networks
نویسندگان
چکیده
Most modern parallel computers are clusters using Myrinet or Ethernet communication networks. Several studies have been published comparing the performance of these two networks for parallel computing, however these focus on average performance, and do not address the distributions of communication times, which can have long tails due to contention effects. In the case of Ethernet with TCP, retransmit timeouts (RTOs) can also occur. Slow communication events may have significant impact, particularly for applications requiring frequent synchronization, where the performance is determined by the slowest process. We have analysed the distributions of communication times for standard MPI routines on Ethernet with TCP and Myrinet with GM communications networks on the same cluster, and studied the scalability of the distributions as the number of communicating processes is increased, and the effect of RTOs for Ethernet with TCP.
منابع مشابه
Performance and scalability of MPI on PC clusters
Abstract. The purpose of this paper is to compare the communication performance and scalability of MPI communication routines on an NT cluster, a Myrinet Linux cluster, an Ethernet Linux cluster, a Cray T3E-600, and an SGI Origin 2000. All tests in this paper were run for the various numbers of processors and 2 message sizes. For most of the MPI tests used in this paper, the T3E-600 and Origin ...
متن کاملImproving the Performance of MPI Collective Communication on Switched Networks
In this paper, we present new algorithms for improving the performance of collective communication operations in MPI. Our target architecture is a cluster of machines connected by a switched network such as Myrinet or switched ethernet. We have developed new algorithms for all the MPI collective communication operations, namely, scatter/gather/reduce, allgather/allreduce, broadcast, reduce-scat...
متن کاملMPICH/Madeleine: a True Multi-Protocol MPI for High Performance Networks
This paper introduces a version of MPICH handling efficiently different networks simultaneously. The core of the implementation relies on a device called ch mad which is based on a generic multiprotocol communication library called Madeleine. The performance achieved with tested networks such as Fast-Ethernet, Scalable Coherent Interface or Myrinet is very good. Indeed, this multi-protocol vers...
متن کاملThe Performance of MPI Derived Types on a SGI Origin 2000 , aCray T 3 E - 900 , a Myrinet Linux Cluster and an Ethernet
This paper compares the performance of MPI derived types with user packing on a SGI Origin 2000, a Cray T3E-900, a Myrinet Linux cluster and an Ethernet Linux cluster. Four communication tests using MPI derived type routines and packing/unpacking techniques are run for a variety of message sizes using four processors on these machines. Except for one test, MPI derived types outperform user pack...
متن کاملAnalytical Models of Probability Distributions for MPI Point-to-Point Communication Times on Distributed Memory Parallel Computers
Measurement and modelling of distributions of data communication times is commonly done for telecommunication networks, but this has not previously been done for message passing communications on parallel computers. We have used the MPIBench program to measure distributions of point-to-point MPI communication times for two different parallel computers, with a low-end Ethernet network and a high...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007